Background of the Invention 
This application is a continuation-in-part of U.S. Patent 
Application of Alizon et al. for "Cloned DNA Se- 

quences Related to the Entire Genomic RNA of Human 
Immunodeficiency Virus II (HIV-2), Polypeptides Encoded by these 
DNA Sequences and Use of these DNA Clones and Polypeptides in 
Diagnostic Kits," filed January 16, 1987, which is a 
continuation-in-part of U.S. Patent Application Serial No. 
931,866 filed November 21, 1986, which is a continuation-in-part 
application of U.S. Patent Application Serial No. 916,080 of^_ 
G£ Montagnier et al. for "Cloned DNA Sequences Related to the 

41 Genomic RNA of the Human Immunodeficiency Virus II (HIV-2), Polv- 

01 

0) peptides Encoded by these DNA Sequences and Use of these DNA 

\", i 

Clones and Polypeptides in Diagnostic Kits," filed October 6, 

Ul 

{! 1986 and U.S. Patent Application Serial No. 835,228 of Montagnier 

j. A et al - t°r "New Retrovirus Capable of Causing AIDS, Antigens 
\jl Obtained from this Retrovirus and Corresponding Ant ibodies and 
£jj their Application for Diagnostic Purposes," filed March 3, 198*. 
The disclosures of each of these predecessor applications are 
expressly incorporated herein by reference. 

The invention relates to cloned DNA sequences analogous to 
the genomic RNA of a virus known as Lymphadenopathy-Associated 
Virus II ("LAV-II" ) , a process for the preparation of these 
cloned DNA sequences, and their use as probes in diagnostic kits. 
In one embodiment, the invention relates to a cloned DNA sequence 
analogous to the entire genomic RNA of HIV-2 and its use as a 
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were detected and plaque purified. Of these phages, three were 
characterized by restriction mapping and South rn blot hy- 
bridization with the E2 insert and probes from its 3 f end (LTR) 
or 5' end (envelope) , as well as with HIV-1 subgenomic probes. 
In this instance, HIV-1 probes were used under non-stringent con- 
ditions. 

A clone carrying a 9.5 kb. insert and derived from a circu- 
lar viral DNA was identified as containing the complete genome 
and designated ^ROD 4. Two other clones, ^ROD 27 and ^ROD 35 
were derived from integrated proviruses and found to carry an LTR 
and cellular flanking sequences and a portion of the viral coding 
sequences as shown in Figure 3, A. 

Fragments of the lambda, clones were subcloned into a plasmid 
vector p UC 18. 

Plasmid pROD 27-5' is derived from ^vROD 27 and contains the 
5 1 2Kb of the HIV-2 genome and cellular flanking sequences (5' 
LTR and 5* viral coding sequences to the EcoRI site) 

Plasmid p ROD 4-8 is dervied from ^ ROD 4 and contains the 
about 5Kb Hind lll fragment that is the central part of the HIV-2 
genome, 

Plasmid pROD 27-5' and p ROD 4.8 inserts overlap. 

Plasmid pROD 4.7 contains a Hind i 1 1 1.8 Kb fragment from 
XrOD 4. This fragment is located 3' to the fragment subcloned 
into pROD 4.8 and contains about 0.8 Kb of viral coding sequences 
and the part of the lambda phage ( ^L47.1) left arm located 
between the Bam Hl and Hind i 1 1 cloning sites. 



Plasmid pROD 35 contains all the HIV-2 coding sequences 3* 
to th Eco RI site, the 3 f LTR and about 4 Kb of cellular flanking 
sequences . 

Plasmid pROD 27-5' and pROD 35 in E. coli strain HB 101 are 
deposited respectively under No. 1-626 and 1-633 at the CNCM, and 
have also been deposited at the NCIB (British Collection). These 
plasmids are depicted in Figure 5. Plasmids pROD 4-7 and pROD 
4-8 in E. coli strain TGI are deposited respectively under 
No. 1-627 and 1-628 at the CNCM. 

To reconstitute the complete HIV-2 ROD genome, pROD 35 is 
linearized with Eco RI and the Eco RI insert of pROD 27-5* is 
ligated in the correct orientation into this site. 

The relationship of HIV-2 to other human and simian ret- 
roviruses was surmised from hybridization experiments. The rela- 
tive homology of the different regions of the HIV-1 and 2 genomes 
was determined by hybridization of fragments of the cloned HIV-1 
genome with the labelled ^ROD 4 expected to contain the complete 
HIV-2 genome (Figure 3, B). Even in very low stringency condi- 
tions (Tm-42°C), the hybridization of HIV-1 and 2 was restricted 
to a fraction of their genomes, principally the gag gene (dots 1 
and 2), the reverse transcriptase domain in pol (dot 3), the end 
of pol and the Q (or sor) genes (dot 5) and the F gene (or 3* 
orf) and 3* LTR (dot 11). The HIV-1 fragment used to detect the 
HIV-2 cDNA clones contained the dot 11 subclone, which hybridized 
well to HIV-2 under non-stringent conditions. Only the signal 
from dot 5 persisted after stringent washing. The envelope gene, 
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Experimental analysis of the HIV-2 ROD isolate yielded the 
following sequence which represents the complete genome of this 
HIV-2 isolate. Genes and major expression products identified 
within the following sequence are indicated by nucleotides num- 
bered below: 

1) GAG gene (546-2111) expresses a protein product having 
a molecular weight of around 55Kd and is cleaved into the follow- 
ing proteins: 

a) p 16 (546-950) 

b) p 26 (951-1640) 



s 'i c) p 12 (1701-2111) 



2) polymerase (1829-4936) 

3) Q protein (4869-5513) 

4) R protein (5682-5996) 

5) X protein (5344-5679) 



\'A 6) Y protein (5682-5996) 

7) Env protein (6147-8720) 

8) F protein (8557-9324) 

9) TAT gene (5845-6140 and 8307-8400) is expressed by two 
exons separated by introns. 

10) ART protein (6071-6140 and 8307-8536) is similarly the 
expression product of two exons. 

11) LTR : R (1-173 and 9498-9671) 
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envlO (1132-1215) 



Ly«ei7fflrA*pFroCla?«UlAT7TM«tTrpThrAit 
AAAeCCTCACACCCACAACTAOCATACATOTCCACTAAC 

• • • • 

C7»Ar|CL701urh«L«ut7rC7*A«aM«tTbrTr»Pb«La«Ai« 
TCCACACOAOAOTTTCrCTACTCCAACAtCACTrOOTtCCTCAAt 

1200 



envll (1237-1305) 

ArgAtoT7ffAUrroC7«Ii9ll« 
CCCAATTAtCCACCCTCCCAtAtA 

LyiClotl.U.A«nTbrtrpIiiL y »V.10l yA r l AiBV»ll7r 
A AC CAA ATAATT AACAC ATCC CATAA06TAC6 OAQAAATOIATAT 

1300 



gagl (991-1053) 

AtpCyiL7iL««VtlL«v»t7»Cl7LtuCl7««tA»oProTbrt«« 
CACT0TAAATTACT6CTAAAAC6ACTACC6AT0AACCCZACCTTA 

1000 . . 

61u01»H«tL«uTbrAl« 
CAA6A6ATGCT6ACC0CC 



Of the foregoing peptides, envl, env2, env3 and gagl are 
particularly contemplated for diagnostic purposes, and env4, 
env5, env6, env7, env8, env9, envlO and envll are particularl 
contemplated as protecting agents. These peptides have been 
lected in part because of their sequence homology to certain 
the envelope and gag protein products of other of the ret- 
roviruses in the HIV group. For vaccinating purposes, the fc 
going peptides may be coupled to a carrier protein by utilizi 



